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Abstract 

Introduction: Infection is a major cause of morbidity and mortality in patients with rheumatoid arthritis (RA). The 
objective of this study was to perform a systematic review and meta-analysis of the effect of glucocorticoid (GC) 
therapy on the risk of infection in patients with RA. 

Methods: A systematic review was conducted by using MEDLINE, EMBASE, CINAHL, and the Cochrane Central 
Register of Controlled Trials database to January 2010 to identify studies among populations of patients with RA 
that reported a comparison of infection incidence between patients treated with GC therapy and patients not 
exposed to GC therapy. 

Results: In total, 21 randomised controlled trials (RCTs) and 42 observational studies were included. In the RCTs, GC 
therapy was not associated with a risk of infection (relative risk (RR), 0.97 (95% CI, 0.69, 1.36)). Small numbers of 
events in the RCTs meant that a clinically important increased or decreased risk could not be ruled out. The 
observational studies generated a RR of 1.67 (1.49, 1.87), although significant heterogeneity was present. The 
increased risk (and heterogeneity) persisted when analyses were stratified by varying definitions of exposure, 
outcome, and adjustment for confounders. A positive dose-response effect was seen. 

Conclusions: Whereas observational studies suggested an increased risk of infection with GC therapy, RCTs 
suggested no increased risk. Inconsistent reporting of safety outcomes in the RCTs, as well as marked 
heterogeneity, probable residual confounding, and publication bias in the observational studies, limits the 
opportunity for a definitive conclusion. Clinicians should remain vigilant for infection in patients with RA treated 
with GC therapy. 



Introduction 

Infection is a major cause of morbidity and mortality in 
patients with rheumatoid arthritis (RA) [1,2]. The 
increased incidence has been attributed to the disease 
itself, associated factors such as smoking and immuno- 
suppressive therapy, or a combination of these. Gluco- 
corticoid (GC) therapy, still widely used in the 
treatment of RA [3], is thought to be associated with an 
increased infection risk as well as other well-established 
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adverse effects [4], GCs are known to impair phagocyte 
function and suppress cell-mediated immunity, thereby 
plausibly increasing the risk of infection [5]. However, 
the extent to which GC therapy contributes to the 
observed increased risk in RA is not clear. 

Surprisingly, despite six decades of clinical experience 
[6], no good summary estimates of infectious risk asso- 
ciated with GC therapy in RA populations exist. Sys- 
tematic reviews have been performed to address the 
efficacy of GC therapy [7], as well as multiple safety out- 
comes from RCTs in RA populations [8,9]. Reviews of 
safety issues from observational studies tend to be nar- 
rative (rather than systematic) reviews, despite the 
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recognition that observational data must complement 
RCT data when assessing the harms of drug treatments 
[10]. No systematic reviews or meta-analyses exist that 
focus on the infection risk associated with GC therapy 
by combining evidence from RCTs and observational 
studies. 

Our primary aim was to perform a systematic litera- 
ture review and meta-analysis (where appropriate) of 
RCTs and observational studies to assess the association 
between systemic GC therapy and the risk of infection 
in patients with RA, compared with patients with RA 
not exposed to GC therapy. Secondary aims were to 
examine the influence of study design, definition of GC 
exposure, and type of infection. 

Materials and methods 

Search strategy 

A search was conducted in MEDLINE, EMBASE, 
CINAHL, and the Cochrane Central Register of Con- 
trolled Trials (Clinical Trials; CENTRAL) database to 
January 2010 to identify studies among populations of 
patients with RA that reported a comparison of infec- 
tion incidence between patients treated with GC therapy 
and patients not exposed to GC therapy. 

Published studies were identified by using separate 
search strategies for RCTs and observational studies. 
The full search strategy can be found in Additional file 
1. In brief, all GC RCTs for RA were sought. Observa- 
tional studies were identified by using the broad key- 
word areas of "rheumatoid arthritis," "infection," and 
"antirheumatic therapy," limiting the search to epide- 
miologic studies. An initial search strategy of "GC ther- 
apy," as opposed to "antirheumatic therapy," missed 
many studies in which the association between GCs and 
infections was reported, but in which GC therapy was 
not included in the title, abstract, or as a key word. 
Exposure was limited to systemic GC therapy: studies 
that reported only intra-articular steroids were excluded. 
We considered only articles published in English 
because of the need to screen large numbers of publica- 
tions by using the complete manuscript. Hand searching 
of reference lists from obtained articles and selected 
review articles also was performed. Abstract-only publi- 
cations and unpublished studies were not considered. 
No authors were contacted for additional information. 

Study selection 

The first selection, based on title and abstract, was done 
by one reviewer (WGD). Studies conducted exclusively 
in non-RA populations were excluded. Studies with 
designs other than RCTs, case-control, or cohort studies 
were excluded at this stage, as were studies of nonsyste- 
mic GC therapy. RCTs that did not randomize GC ther- 
apy were excluded. Case-control studies defined by any 



outcome except infection also were excluded. The full 
manuscripts of all remaining articles were obtained. Any 
uncertainty during initial screening led to retention of 
the article for eligibility assessment. 

Eligibility assessment was then performed indepen- 
dently by two reviewers (WGD and MH), applying the 
following final study-inclusion criteria. For RCTs: (1) 
study population of patients with RA or undifferentiated 
inflammatory polyarthritis, (2) exposure to systemic GC 
therapy (that is, excluding intra-articular and tendon- 
sheath injections) in one arm and nonexposure in a 
further study arm (that is, in which the only major dif- 
ference between the arms was the use of GC), and (3) 
reporting of infection numbers or rates in the two rele- 
vant study arms. If studies reported additional arms 
examining the effect of an alternative active treatment, 
data were analyzed only for the arms comparing GC 
therapy with no-GC exposure. If studies were explicit in 
describing the methods by which they captured infec- 
tion, nonreporting of infection within the results was 
assumed to represent no infections in either group. 
Absent reporting of infection that was in any way 
ambiguous led to exclusion of the study. Studies that 
reported only adverse events leading to drug disconti- 
nuation were included, although grouped separately. For 
observational studies: (1) assessment of infection risk in 
a population (or subpopulation) of patients with RA or 
undifferentiated inflammatory polyarthritis, (2) use of a 
cohort or case-control design to conduct data analysis, 
and (3) provision of a relative-risk or rate-ratio estimate 
for the association between systemic GC therapy and 
infection with a corresponding 95% confidence interval 
(or sufficient data to calculate this) were required. 
These criteria allowed inclusion of open-label extension 
studies if they analyzed infection risk with GC therapy 
compared with no-GC therapy. Helicobacter pylori 
infection was excluded. Disagreements were resolved by 
discussion. 

Data extraction and meta-analysis 

Data on the number of infections or the estimated rela- 
tive risks were extracted by one reviewer (WGD), along 
with characteristics of the studies. Extracted data were 
cross-checked against notes made by both reviewers 
during the eligibility assessment, with resolution by dis- 
cussion in the few instances of disagreement. Informa- 
tion on categorization of GC exposure and types of 
infection was collected. 

Meta-analysis was conducted for RCTs and observa- 
tional studies separately. RCT meta-analysis was per- 
formed initially including all studies, followed by a series 
of a priori sensitivity analyses. In the main analysis, all 
GC-treated arms were combined. Because of the low 
number of events and the sensitivity of the default 
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weighting (the inverse of the variance of the logarithm of 
the odds ratio) to the definition of infection (for example, 
serious or not serious), alternative weighting was per- 
formed by number of patients, then by estimated person 
years of follow-up. To avoid excluding studies in which 
zero events were found in both arms, a sensitivity analysis 
was performed after adding 0.5 to all cells of the 2x2 
table. Additional sensitivity analyses included limiting 
studies to GC doses of < 10 mg prednisolone equivalent 
(PEQ), limiting outcomes to serious infections, and 
excluding studies reporting only events leading to study 
withdrawal. If studies reported more than one type of 
infection, sensitivity analyses were performed to examine 
the influence of using alternative definitions. Different 
analysis methods were considered, given the statistical 
challenge of rare events [11], including the Mantel- 
Haenszel odds ratio (with and without zero-cell correc- 
tion), inverse variance, and weighting by study size. 

A meta-analysis of all observational studies was per- 
formed, stratified by study design (cohort and case con- 
trol). If several strata of exposure (for example, 0 to 5, 5 
to 10, and > 10 mg PEQ) were presented in the absence 
of an overall effect measure, one reported category was 
selected for the meta-analysis. If three categories were 
reported, the middle category was chosen. If only two 
categories were reported, the category with the larger 
number of patients or person time was selected. Ran- 
dom-effects models were used to account for between- 
study heterogeneity by using the DerSimonian and Laird 
method [12]. Similarity between the risk ratio and the 
odds ratio was assumed because infectious events were 
considered rare. Again, several a priori sensitivity ana- 
lyses were conducted. With respect to exposure, dose- 
specific analyses were performed, as well as limiting ana- 
lysis to studies considering only current GC exposure. 
Adjusted and unadjusted analyses were considered sepa- 
rately, as well as exploration of the impact of different 
components of multivariate adjustment (age and sex, dis- 
ease severity, disease duration, comorbidity, and other 
RA therapies). Several specific outcomes were considered 
separately, including all-site serious infections, lower- 
respiratory-tract infections, tuberculosis, herpes zoster, 
and postoperative infections. In response to reviewers' 
comments, we also performed a sensitivity analysis of 
serious infections reported in prospective studies. 

Funnel plots were created to examine the potential for 
small study effects [13]. Statistical heterogeneity was 
assessed by using the Cochrane I 2 statistic [14], in 
which I 2 > 50% represents substantial heterogeneity. All 
analysis was conducted by using Stata/SE version 11. 

Results 

The 1,568 records were identified through parallel data- 
base searching (Figure 1). The results were loaded into 



an electronic bibliographic management system (End- 
Note). After removal of duplicates, 1,309 studies were 
identified and screened by one reviewer (WGD). The 
430 full-text articles were then assessed for eligibility by 
two reviewers (WGD and MH). The 21 RCTs [15-35] 
and 42 observational studies [36-77] (33 cohort, nine 
case-control) were included in the analysis. Details of 
the studies are described in Tables 1 and AF2 (Addi- 
tional file 2). 

There were 1,963 patients included in the 21 RCTs, 
and 526,629, in the 42 observational studies. The mean 
study duration was 41 weeks for the RCTs, and the 
median follow-up time was 1.93 person years per patient 
for the 30 observational cohort studies for which follow- 
up time was available. 

Main results 
RCTs 

In 1,026 GC-treated patients, 59 (5.8%) infections were 
found compared with 51 infections in 937 (5.4%) non- 
GC patients. Ten of 21 studies had no reported infec- 
tions in either arm, and four further studies had no 
infections in one of the two arms. The estimated relative 
risk of infection associated with GC therapy was 0.97 
(0.69, 1.36) (Figure 2). No evidence of statistical hetero- 
geneity was present among the included trials (I = 0.0). 
Observational studies 

Systemic GC therapy was associated with an increased 
risk of infections in observational studies (RR, 1.67 
(1.49, 1.87)). Risk estimates differed by study design, 
with cohort studies generating an RR of 1.55 (1.35, 1.79) 
and case-control studies, 1.95 (1.61, 2.36) (Table 2; Fig- 
ure 3). However, evidence was noted of substantial sta- 
tistical heterogeneity (I 2 = 76% for observational studies 
overall, 71% for cohort studies, and 79% for case-control 
studies). 

Sensitivity analyses 
RCTs 

Sensitivity analyses using alternative weighting, different 
statistical methods of dealing with low event numbers, 
limiting to studies with a placebo rather than active com- 
parator, and limiting to doses < 10 mg PEQ led to no 
major change in the results (Additional file 3). Too few 
studies reported exclusively serious infections, and too 
few events in those studies, warranted a robust meta-ana- 
lysis [18-20]. Studies considered to report predominantly 
nonserious infection generated an RR of 1.05 (0.89, 1.24). 
One study included methotrexate in addition to GC ther- 
apy in the treatment arm (15). Exclusion of this study 
generated an RR of 0.83 (0.57, 1.21). 
Observational studies 

Stratification by dose category showed a positive dose- 
response effect. Studies with average doses of < 5 mg 
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Identification 
(WGD) 



Abstract 
Screening 
(WGD) 



Full-text 
eligibility 
(WGD+MH) 



1562 studies identified 
through database searching 



6 additional studies identified 
through citation index searching 



259 duplicates 



1309 unique studies identified 







879 excluded as 

-Non-RA populations 

-Not RCT7 cohort/ case-control 

-RCTs not randomised to GC therapy 

-Non-systemic GC therapy 


430 full-text articles 
assessed for eligibility 










367 excluded 

-Same criteria as above, or 

-No estimate of infection risk with GCs 



63 studies included in the review 
-21 RCTs 

-42 observational studies (33 cohort, 9 case-control) 



Figure 1 Flow chart demonstrating study selection. GC, glucocorticoid; RA, rheumatoid arthritis; RCT, randomized controlled trial. 



PEQ generated an RR 1.37 (1.18, 1.58) compared with 
an RR of 1.93 (1.67, 2.23) for 5- to 10-mg PEQ. Only 
one study reported an RR for doses between 10 and 20 
mg PEQ (RR, 2.97 (1.89, 4.67)) [68]. Limiting analyses to 
dose categories above a certain threshold also led to a 
dose response: RR, 2.46 (2.08, 2.92) for dose categories 

> 5 mg PEQ, RR 2.97 (2.39, 3.69) for dose categories > 
10 mg PEQ, and RR 4.30 (3.16, 5.84) for dose categories 

> 20 mg PEQ. Doses of < 10 mg PEQ had a pooled esti- 
mate of 1.61 (1.42, 1.84), higher than the risk for studies 
of dosages < 5 mg PEQ. 

Adjustment for age and sex led to an RR of 1.78 (1.58, 
2.01) compared with no adjustment (RR 1.32 (0.97, 1.80)) 
(Table 2). Adjustment for direct measures of disease 
severity did not lead to much change in the risk estimates 
when compared with estimates not adjusted for direct 
measures of disease severity. Disease duration also had 
little impact on the RR. Adjustment for co-morbidity and 
for other RA therapies (disease-modifying antirheumatic 
drugs (DMARDs) and/or biologies) led to estimates 
~40% higher than the unadjusted estimates. Limiting 



analysis to studies defining GC exposure as "current use" 
generated an RR of 1.70 (1.47, 1.97) (Table 2). 

GC therapy was associated with an increased risk of 
all-site serious infection (RR, 1.89 (1.60, 2.24)), lower- 
respiratory-tract infections (RR, 2.10 (1.52, 2.91)), tuber- 
culosis (RR, 1.74 (1.09, 2.76)), herpes zoster (RR, 1.74 
(1.28, 2.36)) and, to a lesser extent, postoperative infec- 
tions (RR, 1.38 (1.02, 1.86)). The risk of serious infec- 
tions persisted when analysis was restricted to 
prospective studies (RR, 1.70 (1.14, 2.55)). Even with 
stratification by outcome, notable statistical heterogene- 
ity remained across outcomes (I 2 = 82%, 51%, 28%, 86% 
and 0, respectively). 

Publication bias 

The funnel plot of RCTs (Figure 3a) was roughly sym- 
metrical, with all studies falling within the 95% CI. The 
funnel plot for observational studies was less symmetri- 
cal and had more outliers (Figure 3b). The Egger test 
for publication bias was nonsignificant for both the 
RCTs (P = 0.936) and observational studies (P = 0.174 
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Table 1 Summary of GC RCTs reporting infection outcomes 



First author 
and year 



Country Setting/Population Arms of RCT (n) 



Duration 
of study 



Type of 
outcome 



Result 



Boers, 1 997 [1 5] 



Chamberlain, 
1976 [16] 



The 

Netherlands 
and Belgium 



UK 



Choy, 2005 [1 7] UK 



Choy, 2008 [18] UK 



Durez, 2007 [19] Belgium 



Durez, 2004 [20] Belgium 



Gerlag, 2004 
[21] 



Heytman, 1994 
[22] 



The 

Netherlands 



Australia 



Jasani, 1968 [23] UK 



Kirwan, 2004 
[24] 



Liebling, 1981 
[25] 



Murthy, 1978 
[26] 

Sheldon, 2003 
[27] 

Van Everdingen, 
2002 [28] 

Wassenberg, 
2005 [29] 



Belgium, 
Sweden, UK 



US 



Ui 



UK 
The 

Netherlands 

Germany/ 

Austria/ 

Switzerland 



155 early RA 
patients from 8 
centers 



49 adult RA patients 
from single center 



91 patients with 
established RA with 
incomplete response 
to DMARDs. 
Multicenter study 

467 patients within 
2 years of diagnosis 
from 42 centers 



44 patients with 
early RA 



27 patients with 
active RA despite 
MTX 

21 patients with 
active RA despite 
DMARDs 

60 patients with 
active RA previously 
treated with NSAIDs 

9 patients with 
erosive RA 



143 patients with 
active RA 



1 0 patients with 
active RA 



24 patients with > 
30 minutes morning 
stiffness 

26 patients with 
active RA 

81 patients with 
active, previously 
untreated RA 

192 patients with 
active RA, disease 
duration < 2 years 



28 weeks 



2- 3.5 
years 



2 years 



46 weeks 



24 weeks 



1 2 weeks 



n/a 



2 years n/a 



Combination therapy - step- 
down prednisolone from 60 
mg, step-down MTX and 
SSZ (76) vs SSZ 
monotherapy (79) 

5 mg prednisolone (20) vs 

3 mg prednisolone (10) vs 
0 mg prednisolone (19) 
Allowed concomitant gold 

Monthly 120-mg 
intramuscular depomedrone 
(48) vs placebo (43) 
Allowed usual DMARDs 

MTX (117) 

MTX + cyclosporin (119) 
MTX + step-down 
prednisolone (1 15) 
MTX + cyclosporin + 
prednisolone (1 16) 

MTX monotherapy (14) 
MTX + 1 g iv 
methylprednisolone 3 (15) 
MTX + infliximab 3 (15) 
Infusions weeks 0, 2, 6; then 
8 weekly 

MTX + 1 g iv MP week 0 
(15) 

MTX + infliximab weeks 0, 2, 
and 6 (12) 

60 mg prednisolone week 1, 2 weeks n/a 
then 40 mg prednisolone 
week 2 (10) 
Placebo (11) 

Gold plus either 1 g iv 
methylprednisolone weeks 0, 
4, and 8 (30) or placebo (30) 

4 x 1-week crossover study 
of ibuprofen 750 mg, aspirin 

5 g, prednisolone 15 mg, 
and lactose as placebo 

Budesonide, 3 mg (37), 
budesonide, 9 mg (36), 
prednisolone, 7.5 mg (39), 
placebo (31) 



Infections treated 
as outpatient 



a) All-site serious 
infections 

b) Respiratory 
tract infections 



a) Serious 
infection 

b) 'benign' 
infection 



12 infections in combination 
arm, 6 in SSZ monotherapy 
arm 



No infections 



No infections either arm 



a) 7, 3, 4, and 2 serious 
infections in the four 
respective arms 

b) 54, 51, 49, and 55 
respiratory tract infections in 
the four respective arms 

a) No serious infections in any 
arm 

b) 14, 12, and 12 benign 
infections in the three arms, 
respectively 



14 weeks Serious infections None in either arm 



All patient- 
reported side 
effects 



4 weeks n/a 



a) Respiratory 
infections 

b) Viral infections 



Crossover trial of monthly 1- 
g iv methylprednisolone vs 
placebo 

Indomethacin, 25 mg x 4 
(12), prednisolone, 5 mg (12) 

Budesonide (14) or placebo 
(12) plus usual DMARDs 

10-mg prednisolone (40), 
placebo (41) 

Gold or MTX plus either 5 
mg prednisolone (93) or 
placebo (96) 



12 

months (6 
months 
per arm) 

2 weeks 



n/a 



n/a 



4 weeks n/a 



2 years 



2 years 



Data reported on 
infections treated 
with antibiotics 

All adverse 
events collected, 
reported only if 
occurred in 3 or 
more patients 



1 skin infection in placebo arm 
only 



No infections reported 



No infections reported 



a) 7, 4, 6, and 1 respiratory 
infections in the 4 groups, 
respectively. 

b) 4, 1, 0, and 0 viral infections 
in the four groups, respectively 

4 infections on placebo, 2 on 
GC 



No infections reported 



2 cases of influenza (one from 
each group). 

17 infections in 40 patients in 
GC arm, 22 infections in 41 
patients in placebo arm 

Total 4/93 and 3/96 (Bronchitis 
in 3/93 prednisolone group, 0/ 
96 placebo group. Influenza in 
1/93 prednisolone group, 3/96 
placebo) 
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Table 1 Summary of GC RCTs reporting infection outcomes (Continued) 



Williams, 1982 


UK 


20 patients with 


1 -g iv methylpredisono one 


6 weeks 


"Serious side 


None reported 


R01 




active RA 


(10) or placebo (10) 




effects" 




U/nnn 1 QQn 

vvoriy, i yyu 


Australia 


40 patients with 


Gold plus either three pu ses 


24 weeks 


Patients 




1 injection-site infection in 






active RA previous y 


of 1 g intravenous 




interviewed for 


nlarpho nroiin 






treated with NSAIDs 


methylpredniso one weeks 0, 




a I possible side 










4, + 8 (20) or placebo (20) 




effects 




Capell, 2004 


UK 


167 patients with 


SSZ plus either 7 mg 


2 years 


Withdrawals due 


No discontinuations due to 


P-l 




active RA on no 


prednisolone (84) or placebo 




to side effects 


infection in either group 






DMARD therapy 


(83) 








Svensson, 2005 


Sweden 


250 patients with 


DMARD + prednisolone, 7.5 


2 years 


Adverse events 


1 abscess in non-prednisolone 


[33] 




active disease on 


mg (119), DMARD alone, 




leading to 


group. No infections leading to 






DMARD therapy 


open, no placebo (131) 




withdrawal 


discontinuation in 














predniso one group 


Van der Veen, 


The 


30 patients with 


Oral MTX plus either 


1 year 


Adverse events 


1 pneumonia in placebo 


1 993 [34] 


Netherlands 


active RA 


placebo (10) or 100 mg oral 




leading to 


group (at week 12) 








prednisolone days 1, 3, and 




discontinuation 










5 (10) or 1 g iv MP days 1, 3, 




of MTX 










and 5 (10) 








van 


The 


56 patients with 


Chloroquine, 100 mg/day 


2 years 


Withdrawal due 


No discontinuations due to 


Schaardenburg, 


Netherlands 


active RA aged > 60 


(28) (rescue with gold, then 




to adverse 


infections in either group 


1995 [35] 




previously treated 


SSZ allowed) vs 




advents 








with NSAIDs 


prednisolone 15 mg/day, 














tapered after 1 month (28) 









DMARD, disease-modifying antirheumatic drug; iv, intravenous; ivMP, intravenous methylprednisolone; MTX, methotrexate; NSAIDs: nonsteroidal anti- 
inflammatory drugs; RA, rheumatoid arthritis; SSZ, sulfasalazine. a lnfusions weeks 0, 2, 6; then 8 weekly. 



Study 

ID 


< 
















% 


GC safer 






GC harmful 


RR (95% CI) 


Treatment 


Control 


Weight 


Choy 0B (MTX +/- pred); serious 
Choy 08 (MTX + clclo +/- pred): serious 
Gerlag 












2.08 (0.82, 5.26) 
0.58(0.17, 1.93) 
0.68 (0.12, 4.02) 
0.36 (0.02, 8.03) 


12/76 
4/115 


6/79 
7/117 


11.01 




— — 




















2/115 


37119 


5 54 












OflO 


1/11 


2 68 


Kirwan : respiratory 

Liebling 

Sheldon 












4.71 (0.65, 33.98) 
0.50(0.12, 2.14) 
0.86 (0.06, 12.28) 


17/112 


1/31 


2.93 








— ■ 














2/10 
1/14 


4/10 
1/12 


7.48 
2.02 












Van Everdingen 
Wassenberg 












0.79 (0.50, 1.25) 
1.38 (0.32, 5.98) 
0.33 (0.01,7.72) 
0.37 (0.02, 8.92) 
0.17(0.01,3.94) 


17/40 
4/93 
0/20 
0/119 
0/20 


22/41 
3/96 
1/20 


40 66 












5 52 










2.81 










1/131 


2.67 














1/10 


3.66 


Chamberlain 












(Excluded) 


0/30 


0/19 


0 00 


Choy 05 












(Excluded) 


0/48 


0/43 


0.00 


Durez 07: serious 












(Excluded) 


0/15 


0/14 


0.00 


Durez 04 












(Excluded) 


0/15 


0/12 


0.00 


Heytman 












(Excluded) 


0/30 


0/30 


0.00 


Jasani 












(Excluded) 


0/9 


0/9 


0 00 


Murthy 












(Excluded) 


0/12 


0/12 


0.00 


Williams 












(Excluded) 


QV10 


0/10 


0.00 


Capell 












(Excluded) 


0/84 


0/83 


0.00 


van Schaardenburg 












(Excluded) 


0/28 


0/28 


0.00 


Overall (l-squared - 0.0%, p - 0-530) 




< 


> 






0.97(0.69, 1.36) 


59/1026 


51/937 


100.00 


1 


I 



.1 1 10 



Figure 2 Meta-analysis of infection risk in randomized controlled trials of systemic glucocorticoid therapy. 
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Table 2 Study design factors within observational studies and their influence on relative risk of infection associated 
with glucocorticoid therapy 



Number of studies 



Mean RR 



r statistic 



Ratio of RR 



Study design 

Cohort 

Case-control 
Definition of exposure 

Baseline 

Current (within 3/12) 
Recent (within 6/12) 
Ever 
Unclear 
Adjusted for age and sex 
No 
Yes 

Adjusted for disease severity 
No 

Adjusted for surrogate 
Adjusted for direct measurement 
Adjusted for disease duration 
No 
Yes 

Adjusted for comorbidity 
No 

Yes 

Adjusted for other RA therapies 
No 
Yes 



33 



5 

22 
7 
2 
6 

22 
19 

2-1 
10 

6 

33 
6 

22 
17 

22 
18 



1.55 (1.35, 1.79) 
1.95 (1.61, 2.36) 

1 .46 (0.87, 2.45) 
1.70 (1.47, 1.97) 

1.56 (1.24, 1.96) 
1.80 (1.29, 2.51) 
2.35 (1.27, 4.36) 

1 .32 (0.97, 1 .80) 
1.78 (1.58, 2.01) 

1.41 (1.14, 1.75) 
1.98 (1.68, 2.34) 
1.52 (1.17, 1.97) 

1.63 (1.41, 1.89) 
1.55 (1.20, 2.01) 

1 .30 (0.97, 1 .74) 
1.74 (1.55, 1.96) 

1 .28 (0.98, 1 .67) 
1.84 (1.62, 2.08) 



71.3% 
79.4% 

79.7% 
58.9% 
79.5% 
52.5% 
36.5% 

67.6% 
82.3% 

71.3% 
78.5% 
77.0% 

76.8% 
83.5% 

64.2% 
75.1% 



61.1 
82i 



1 .00 (referent) 
1.26 

1 .00 (referent) 

1.16 

1.07 

1.23 

1.61 

1 .00 (referent) 
1.35 

1 .00 (referent) 

1.40 

1.08 

1 .OO(referent) 
0.95 

1 .00 (referent) 
1.34 

1 .00 (referent) 
1.44 



RR, relative risk. 



for cohort studies and P = 0.576 for case-control 
studies). 

Discussion 

RCTs and observational studies generated different esti- 
mates of infection risk associated with GC therapy. The 
RCT meta-analysis suggested a null association between 
GC therapy and infection risk (RR, 0.97 (0.69, 1.36)). The 
confidence interval included both clinically meaningful 
increased risks (up to 35% increase) and decreased risks (up 
to a 30% reduction), making the result inconclusive. The 
observational studies provided an overall RR of 1.67 (1.49, 
1.87), suggesting a significant, clinically important increased 
risk. However, significant heterogeneity was found within 
the studies. Even after performing multiple sensitivity ana- 
lyses around exposure definition, outcome, and adjustment 
for confounders, marked heterogeneity remained a pro- 
blem. Nonetheless, most analyses of observational studies 
reported an increased risk of infection, which conflicts with 
the result of the RCTs. The dose of GC therapy varied both 
within and between RCTs and observational studies and 
may contribute to our observed result. However, we were 
able to perform meta-analyses within both study designs to 



investigate the risk associated with daily doses < 10 mg 
PEQ. The differential results between study designs 
remained. Although it is not yet clear to what extent the 
risk of infection is influenced by historic (or cumulative) 
GC therapy, patients in the observational studies are likely 
to have had longer cumulative exposure than are patients 
within the short-duration RCTs. This difference may go 
some way to explaining the apparent discrepancy in the 
results from the two study designs. 

Both study designs had major limitations when 
addressing infection risk. The big challenges in RCTs 
were poor reporting of methods and results and the sta- 
tistical challenge of rare outcomes. For observational 
studies, heterogeneity, lack of detailed reporting, con- 
founding, and bias (in particular publication bias) were 
particularly problematic. Other factors affecting the 
results and interpretation included variability of sam- 
pling frame, inclusion and exclusion criteria, definition 
of comparison groups, and time-varying GC exposure. 

Reporting of methods and results in RCTs 

GC exposure was usually well defined within RCTs. On 
occasions, additional GC therapy was allowed at the 
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Study 
ID 

Cohort studies 



Askling 

Bergstrom 

Bongartz 

Breedveld: all infections 

Carpenter 

Coyne 

Curtis 

Doran 

Edwards 

Favalli 

Fleischmann: serious infection 

Franklin 

Giles 

Grijalva: serious infection 

Huscher 

Jain 

Jenks 

Lacaille: serious infection (GC + no DMARDs) 

Luessenhop 

Malysheva 

McDonald 

Mertz 

Murata 

Saag 

Salliot 

Schnabel 

Schneeweiss 

Sihvonen 

Strangfeld 

Tanaka 

Wolfe 06: Pneumonia 
Wolfe 04 
Wolfe 06: Zoster 

Subtotal (l-squared = 71 .3%, p = 0.000) 

Case-control studies 

Bernatsky: serious infection 
Brassard 06 
Brassard 09 
Hamalainen 
Harigai 

Hernandez-Cruz 
Smitten 07: GPRD 
Smitten 07: PharMetrics 
Smitten 08 
Wilson 

Subtotal (l-squared = 79.4%, p = 0.000) 
Overall (l-squared = 76.3%, p = 0.000) 



GC safer 



GC harmful ES (95% CI) 



% 

Weight 



Si 



I — •- 



I ♦ 




0.90 
1.23 
1.28 
1.20 
1.42 
3.68 
1.46 
1.56 
2.94 
1.53 
2.48 
2.20 
0.54 
2.39 
0.66 
0.22 
3.50 
1.90 
1.20 
3.39 
1.41 
1.39 
1.90 
8.00 
1.42 
0.34 
2.14 
0.70 
1.86 
1.01 
1.70 
0.33 
1.50 
1.55 



2.56 
1.70 
2.40 
1.29 
3.76 
1.60 
1.46 
2.51 
1.92 
1.97 
1.95 



0.69 
0.36 
0.46 
0.69 
0.13 
1.86 
0.84 
1.20 
1.93 
0.69 
1 .60 
1.50 
0.13 
1.63 
0.34 
0.02 
0.73 
1.75 
0.39 
0.44 
1.19 
0.28 
1.00 
1.00 
1.04 
0.19 
1.50 
0.18 
0.92 
0.12 
1.50 
0.03 
1 .20 
1.35 



'2.29, 
1.30, 
1.10, 
0.79, 
1.37, 
0.87, 
1.24, 
2.05, 
1.67, 
0.88, 
1 .61 , 



1.17) 
4.20) 
3.56) 
2.08) 
15.64) 
7.31) 
2.54) 
2.03) 
4.48) 
3.39) 
3.84) 
3.23) 
2.22) 
3.50) 
1.25) 
1.91) 
16.72) 
2.06) 
3.69) 
26.12) 
1.67) 
6.83) 
3.61) 
64.00) 
1.96) 
0.60) 
3.05) 
2.68) 
3.76) 
8.51) 
1.93) 
3.16) 
1.87) 
1.79) 



2.86) 

2.22) 

5.24) 

2.13) 

10.32) 

2.94) 

1.72) 

3.07) 

2.21) 

4.38) 

2.36) 



4.01 
0.75 
1.02 
2.34 
0.22 
1.83 
2.35 
4.03 
3.04 
1.49 
2.95 
3.26 
0.59 
3.26 
1.97 
0.27 
0.49 
4.95 
0.88 
0.30 
4.58 
0.48 
1.98 
0.29 
3.66 
2.22 
3.43 
0.65 
1.76 
0.28 
4.79 
0.25 
4.27 
68.65 



4.85 
3.99 
1.53 
2.61 
1.04 
2.11 
4.61 
4.39 
4.73 
1.48 
31.35 



1.67(1.49,1.87) 100.00 



Figure 3 Meta-analysis of infection risk in observational studies, stratified by study design (1, cohort; 2, case-control) 



discretion of the treating physician, and this was rarely 
quantified. In contrast, safety outcomes from RCTs lacked 
any standardized reporting of methods or results. Methods 
sections at times omitted any mention of safety assessment 
[30,78] or were too vague to be helpful (for example, 
"records of ... adverse reactions... were kept") [79]. In the 
results sections, selective reporting was problematic and 
included reporting of only pre-selected events (for exam- 
ple, fractures and ophthalmologic complications [80]), 
events known to be associated with GC therapy [17], 



events occurring in more than two patients [29], or events 
leading to withdrawal). Reporting only events with a fre- 
quency beyond a certain threshold would miss rare events, 
potentially imbalanced across multiple studies. Withdrawal 
studies (in which reporting was complete) provided mea- 
sures of relative risk that could be included in the analysis. 
It is important that exclusion of these studies in a sensitiv- 
ity analysis did not change the overall results. Vague 
reporting was also common. Phrases such as "no meaning- 
ful toxicities were reported by the participants in either 
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RCTs 

Funnel plot with pseudo 95% confidence limits 



Observational studies 

Funnel plot with pseudo 95% confidence limits 
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Figure 4 Funnel plots of risk ratios in (a) RCTs and (b) observational studies, stratified by study design 



group" [81] or "the proportion of patients who reported 
adverse reactions [did not] differ between groups accord- 
ing to type of treatment" [79] did not provide sufficient 
information on infections to warrant inclusion. Reporting 
of symptoms rather than diagnoses meant we had to 
decide subjectively (but independendy) whether infections 
were present. We sought to include studies with an infec- 
tion incidence of zero, only if this was explicit or could be 
confidently inferred. Although this was ambiguous at 
times, the use of two independent reviewers made study 
selection more robust. 

Reporting of adverse drug reactions or side effects 
(with assumed causality) rather than all adverse events 
(in which causality is not assumed) was common. For a 
common event such as infection, causality is difficult to 
establish. Recent guidelines advise "terms that do not 
imply causality (such as 'adverse events') should be the 
default term to describe harms, unless causality is rea- 
sonably certain" [82]. 

Nonstandardized reporting in RCTs was a major pro- 
blem in collating information. Different definitions of 
infection meant that summary risk estimates were aver- 
aged across different outcomes. We attempted to perform 
sensitivity analyses limited to serious or nonserious infec- 
tions but were limited by low numbers. Underreporting of 
nonserious infections was likely: nonserious respiratory 
infections account for 300 to 400 general practice consul- 
tations annually per 1,000 registered patients in the United 
Kingdom [83]. Applying these rates to the RCTs, for 
example in the 2-year study of 192 patients by Wassenberg 
[29], we might expect > 100 nonserious infections. The 
reported number of infections was only seven. 



examine rare outcomes [11]. We used a variety of techni- 
ques including the Mantel-Haenszel odds ratio (with and 
without zero-cell correction), inverse variance, and weight- 
ing by study size to explore sensitivity to change. Although 
all methods failed to show a definite harmful or protective 
effect of GC therapy, all analyses included clinically impor- 
tant harms and benefits within the confidence intervals. 
GC therapy might be associated with a < 35% increased 
risk of infection, or a 30% reduction. Although GCs are 
widely thought to increase the risk of infection, it is plausi- 
ble that they might decrease the risk at these lower doses 
by controlling disease severity. The broad confidence 
intervals that span regions of clinically important effects in 
both directions are a consequence of low numbers of 
events, despite a meta-analysis of all existing studies. 

Inconsistent capture or reporting of infections has an 
impact on the weighting of studies within a meta-analysis. 
Fewer events within a study result in an increased variance 
and thus a lower weighting. We therefore applied alterna- 
tive weightings including total number of patients and 
estimated total person time, so studies with high numbers 
of patients but few infections would contribute more 
weight to the meta-analysis. For example, a 2-year study of 
250 patients with one discontinuation for infection [33] 
contributed only 2.7% weight to the original meta-analysis, 
but increased to 17.6% when weighted by numbers of 
patients or 23.2% by person-time. The absence of a signifi- 
cantly increased risk in these sensitivity analyses is reassur- 
ing, although again, we cannot conclude that GCs are not 
associated with an increased (or decreased) risk of infec- 
tion: the confidence intervals included up to a 70% 
increased or decreased risk, which is clinically meaningful. 



Rare events in RCTs 

Much debate has occurred about the analytic and metho- 
dologic challenges of conducting meta-analyses to 



Heterogeneity in observational studies 

Although RCTs have some heterogeneity, for example in 
background therapy or entry criteria, the variability in 
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observational studies is much wider. The observational 
studies reflected a wide range of settings and popula- 
tions, including year of recruitment, disease duration, 
disease severity, GC therapy practice, co-therapy, co- 
morbidity, geography, health-care systems, and recruit- 
ment methods (for example, single-center surgical 
experience, administrative database, biologies register). 
Each has its own implication for risk estimates, but the 
multiple domains of difference meant that much hetero- 
geneity existed within the studies. Even after stratifica- 
tion within any chosen domain, many differences 
remained in the other areas of potential heterogeneity, 
and the I values often remained high. Nonetheless, 
within this heterogeneity, the direction of effect typically 
suggested an increased risk associated with GC therapy, 
with only six of 42 studies reporting a relative risk of < 
1. Statistical heterogeneity thus likely arose from differ- 
ent effect sizes. 

It has been argued that meta-analysis of published 
nonexperimental data should be abandoned [84]. Others 
argue that careful consideration of sources of heteroge- 
neity within a systematic review can offer more insights 
than the "mechanistic calculation of an overall measure 
of effect, which will often be biased" [85]. We ran many 
stratified analyses to consider the impact of these possi- 
ble factors, producing some useful results, such as 
demonstrating a dose response. 

Lack of detailed reporting in observational studies 

Clear reporting of methods and results was a problem in 
observational studies as well as in RCTs, in particular, 
the definition of GC exposure and methods of risk attri- 
bution. This is important for GC therapy in RA because 
of its intermittent pattern of use and multiple routes of 
administration. GC therapy was rarely the primary expo- 
sure of interest in these observational studies, but 
merely one of many possible exposures or covariates, 
perhaps explaining the lack of detail. Methods sections 
rarely reported clearly on how GC exposure was cap- 
tured, although each study design provided certain 
opportunities for defining exposure. For example, in 
prescription databases, clinician reporting, or case note 
review without clarity about exposure, interpreting the 
many study results was challenging. Even when the 
source of exposure was clearly described, the definitions 
for "GC exposed" were rarely consistent. GC exposure 
was variously defined as ever exposed during the study 
period [37], exposed at study baseline [36], or recent 
[75] or current exposure [39] at the time of infection. 
Even within exposure categories, definitions varied. For 
example, current exposure at the time of infection 
included definitions of GC prescriptions within 30 days 
of the event, 45 days, and beyond. Risk windows used in 
the analyses included "on drug" [39,59], "on drug plus 



lag window" [68,71], and "ever exposed" [36,66]. Such 
analytic variability can produce different results even 
within one study [86]. Exploration of dose within obser- 
vational studies was restricted by reporting. We were 
able to explore a possible dose-response only in studies 
that stratified by dose. Variability in the time period was 
found when average dose was considered, similar to yes/ 
no definitions of exposure, adding additional heteroge- 
neity. Definition and sources of outcomes as well as 
methods of verification (when undertaken) also varied 
between studies. Sources of infection ranged from elec- 
tronic medical records, through case-note review or 
direct clinician reporting, to linkage with national inpati- 
ent registers. 

Several risk estimates had to be excluded because of 
problems with reporting, including typographic errors 
with point estimates outside of confidence intervals, and 
absent confidence intervals around reported point esti- 
mates [39,87]. Other studies reported average GC dose 
for cohorts of patients, but the absence of absolute 
patient numbers receiving GC therapy prevented 
inclusion. 

Confounding and bias in observational studies 

Confounding by disease severity, whereby patients with 
more-severe disease (and thus at a higher risk of infec- 
tion) are more likely to receive steroids, was a major 
concern. This potential bias is unavoidable in observa- 
tional drug studies. Confounding by contraindication 
was another possibility, in which patients with high 
comorbidity or frailty are considered too high risk for 
traditional DMARDs, and are instead treated with GCs. 
Within the meta-analysis, we stratified studies into 
those that reported unadjusted and adjusted risk esti- 
mates. Interestingly, the adjusted analyses provided a 
higher estimate of risk than did the unadjusted analyses, 
contrary to what we expected. If high disease severity 
and high comorbidity were reasons for receiving GC 
therapy (and both are independent risk factors for infec- 
tion), we would have expected the adjusted analyses to 
move toward the null. However, clinical decisions are 
complex, and more than these two variables are consid- 
ered, leaving the possibility of residual confounding. 

Publication bias is an important consideration, present 
at several levels. First, researchers who found a positive 
"statistically significant" association between GC therapy 
and infection risk may be more inclined to include this 
result in their article. Indeed, 23 of 42 observational stu- 
dies had statistically significant increased risks, with sev- 
eral just reaching the threshold of significance. 

Second, techniques such as forward or backward 
selection for multivariate analysis automatically reject 
nonsignificant results. If GC therapy was only one of 
many covariates of interest, it is plausible that only the 
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significant results were reported. We found examples of 
studies in which GC therapy was included in a multi- 
variate model, but no subsequent GC risk estimate was 
reported [88]. At times, it was explicitly reported that 
no association was found, but either no measure of 
effect was provided [89-93], or only a P value > 0.05 
was reported [94]. Exclusion of these null studies would 
result in a false inflation of the summary risk estimate 
and is a major concern. 

Third, having discovered a significant association, 
researchers may be more inclined to submit for 
publication. 

Fourth, reviewers may be more inclined to accept. Pub- 
lication bias means that the infection risk with GC ther- 
apy is likely to be less than the estimated RR of 1.67. 
Unfortunately, we cannot know how far correction for 
publication bias would move the result toward the null. 

Quality of included studies 

When combining multiple studies, we must consider not 
only the results from those individual studies, but also 
the quality of the studies. At present, no accepted 
instruments are available to assess the quality of studies 
that evaluate harms [82,95]. We did attempt to assess 
the included studies according to scales but found that 
the scores oversimplified the limitations, lacked discri- 
mination between studies, and missed other important 
factors. For example, the McHarm scale [96] scores 
reporting of both serious and severe harms as well as 
deaths. Very few of the observational studies had the 
primary aim of examining the safety of GC therapy, and 
thus did not consider severity or death. The Newcastle 
Ottawa Scale [97] includes a domain about comparabil- 
ity, or adjustment for confounders. The majority of stu- 
dies adjusted for confounders, yet wide variation existed 
in the covariates used. We have listed the confounders 
adjusted for within Table AF2 to provide the reader 
with study-specific details and performed sensitivity ana- 
lyses by using different adjustments. Ascertainment of 
exposure and outcome [97], as already discussed, was 
challenging to assess because GC therapy was only one 
of many covariates and often not the primary exposure 
of interest. Such lack of detail meant that we were lim- 
ited in generating a meaningful or accurate score. None- 
theless, no studies appeared to have different methods 
of ascertainment of the exposure/outcome for the cases 
and controls exposed and comparison cohorts. 

Conclusions 

Given these numerous problems with both study designs 
in assessing the infection risk with GC therapy, how can 
we best summarize? The interventional nature of RCTs 
provides an opportunity to isolate and examine the effect 
of therapy. To overcome the problem of small numbers 



of events in individual studies, meta-analysis can collate 
results and enhance this useful experimental study design 
to address safety. Multiple analytic models all reached 
the same broad estimate, providing reassurance. Unfortu- 
nately, all estimates were derived from the selected stu- 
dies after exclusion of studies of lower-quality methods 
and reporting. The results are valid only if the included 
studies were representative of all studies, and this is 
something we cannot assess. Of greater concern was the 
outcome ascertainment and reporting, which was gener- 
ally of poor quality. The clear variation in methods of 
ascertainment and reporting within our included studies, 
plus likely underreporting, leads to anxiety about the 
meta-analysis result. The observational studies are harder 
still to untangle. Many issues cloud the picture, in parti- 
cular methods for defining exposure and risk attribution, 
residual confounding, and publication bias. Replication of 
results does not allay these concerns. We must conclude 
that the risk of infection associated with systemic GC 
therapy in patients with RA remains uncertain, despite 
six decades of clinical experience. However, one consis- 
tent finding is that we cannot rule out the possibility of a 
clinically important increased risk, from either the RCTs 
or the observational studies. Improved, standardized 
reporting of harms [98] and improved access to patient- 
level, time-dependent data from RCTs would improve 
the ability to assess adequately the risks of specific 
adverse events. Within observational studies, clear defini- 
tions of drug exposure and risk attribution, as well as 
reporting of effect sizes, irrespective of statistical signifi- 
cance [99], would advance our knowledge. 
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